HTML sanitization is the process of examining an HTML document and producing a new HTML document that preserves only whatever tags are designated "safe". HTML sanitization can be used to protect against cross-site scripting and SQL injection attacks by sanitizing any HTML code submitted by a user.
Tags often allowed are <b>, <i>, <u>, <em>, and <strong>.
In PHP this can be performed using the strip_tags()
function.[1]
In Java this can be achieved by using OWASP Java HTML Sanitizer Project [2]